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Method for amplification of nucleic 
acids of low complexity 

The present invention relates to a method for the ampli- 
fication of nucleic acids. 

Description . 

This invention relates to the fields of genetic engineer- 
ing, molecular biology and computer science, and more 
specifically to the field of nucleic acid analysis based 
on specific nucleic acid amplification. 



The matter of the present invention is a method for am- 
plifying nucleic acids, such as DNA by means of an enzy- 
matic amplification step, such as a polymerase chain re- 
action, specified for template nucleic acids of low com- 
plexity, e.g. pre-treated DNA, like but not limited to 
DNA pre-treated with bisulfite. "The invention is based on 
the use of specific oligo-nucleotide primer molecules to 
solely amplify specific pieces of DNA. It is disclosed 
how to optimize the primer design for a PCR if the tem- 
plate DNA is of unusually low complexity. Also, for the 
optimal primer design it was considered that the treated 
template DNA is single stranded. 

The amplification of nucleic acids relies mainly on a 
method called polymerase chain reaction (PCR) . The PCR is 
based on the activity of the enzyme DNA polymerase, which 
is elongating primer molecules, which bind to the tem- 
plate DNA by adding dNTPs and hereby copying the template 
sequence (Saiki RK, Gelfand DH, Stoeffel S, Scharf SJ, 
Higuchi R, Horn T, Mullis KB and Erlich HA (1988) . 
Primer-directed enzymatic amplification of DNA with a 
thermostable DNA polymerase. Science 239 ; 487-491) . The 




primer molecules are designed to specifically hybridize 
to those regions of the template DNA that define both 
ends of the amplif icate. The forward primer binds to the 
5' end of the sense strand of the amplif icate, whereas 
the reverse primer binds to the 5' end of the reverse 
strand, hereby defining the starting points of the poly- 
merase reaction and eventually determining the length of 
the amplif icate. 

Before the polymerase starts the template DNA gets dena- 
tured, this is usually done by a short cycle of heating 
the reaction mixture up to about 95°C, then cooling it 
down to the annealing temperature determined by the melt- 
ing temperature of the primer molecules used and finally 
allowing the polymerase to elongate the annealed primers 
at its ideal working temperature for some minutes. This 
cycle is repeated several times each starting with the 
denaturation step. The primer molecules hybridize to the 
single stranded DNA. The forward primer is the starting 
molecule for a copy of the sense strand and the reverse 
primer is the starting molecule for a copy of the anti- 
sense strand. 

These first copies will be of unspecific length, limited 
only by the polymerase's activity. However in the follow- 
ing cycle, the forward primer will also bind to the first 
copy of the anti-sense strand, the polymerase will take 
that copy as a template and will elongate the primer only 
as far as there is template DNA. Hereby the length of the 
second copy gets limited to the length defined by the 
first nucleotide of the second primer. In the following 
cycles more and more pieces of template DNA compete for 
the primer molecules and eventually the DNA amplificate 
of defined length will be the main product. 



1386-A.DOC 



3 



However, in the case of a bisulfite treated DNA the tem- 
plate DNA is single stranded. The bisulfite or similar 
treatment alters the original sequences on both strands 
such that these are not complementary to each other after 
the treatment. As a result no complementary strand to the 
target sequence exists. A first primer molecule binds to 
the one end of the single stranded target sequence. The 
polymerase elongates said primer and copies said target 
sequence. The second primer molecule cannot bind to the 
complementary, so called anti-sense strand, as it would 
in a standard PCR. Therefore the second primer molecule 
is designed to bind to the first copied sequence instead. 
More specifically it will bind to that part of the copied 
nucleic acid which is the complement to the other end of 
said target sequence. 

The results of a PCR are highly depending on the choice 
of the ideal primer. The choice of a primer molecule must' 
respect constraints permitting a correct amplification by 
PCR, fulfilling hybridization temperature conditions and 
auto- or hetero-hybridization prevention. 

In other words, as any PCR requires two primer molecules 
to amplify a specific piece of DNA in one reaction the 
melting temperatures of both primers need to be very 
similar in order to allow proper binding of both at the 
same hybridization temperature. That is why most primer 
design programs require the user to define a preferred 
melting temperature or a permitted range of melting tem- 
peratures. This requirement becomes the limiting factor 
when designing primers for a so called multiplex PCR, as 
all primer pairs in use need to have the. same or at least 
very similar melting temperatures. Additionally primers 
have to be very specific, in order to only amplify those 
pieces of DNA that are the target. 
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By providing the means for designing extremely accurate 
primer pairs for DNA hybridization procedures this inven- 
tion relates to the so called PCR primer design. More 
specifically the body of this invention relates to the 
specific requirements of primers and therefore of primer 
design when using template DNA that consists of essen- 
tially only three different nucleotides and is single 
stranded. This is the case when using bisulfite treated 
DNA as a template, as it contains no cytosine other than 
the methylated cytosines in a CG dinucleotide and a rest 
of insufficiently treated and therefore untransf ormed 
non-methylated cytosines. The invention relates specifi- 
cally to the primer design when using bisulfite treated 
DNA as template. 

It would be obvious to an individual skilled in the art 
that the use of the primers as specified in this inven- 
tion are not limited to nucleic acid amplification. Said 
primers can be used for several purposes, such as ampli- 
fication, but also for nucleic acid sequencing or as 
blocking oligonucleotides during analysis of bisulfite 
treated DNA. Therefore the use of said primers is not 
limited to nucleic acid amplification but extends to all 
standard molecular biological methods. 

Pairs of these primers are used to specifically amplify 
DNA from a small amount of sample DNA that consists of 
bisulfite treated DNA originating from a limited source 
of DNA like a bodily fluid or tissue sample. 

DNA can occur methylated or non-methylated at certain po- 
sitions and this information is relevant for the status 
of a genes transcription. The methyl group is attached to 
the cytosine bases in CpG positions. The identification 
of 5-methylcytosine in a DNA sequence as opposed to un- 
methylated cytosine is of greatest importance for example 
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when studying the role of DNA methylation in tumorigene- 
sis. But, because the 5-Methylcytosine behaves just as a 
cytosine for what concerns its hybridization preference 
(a property relied upon for sequence analysis) its posi- 
tions can not be identified by a normal sequencing reac- 
tion. Furthermore in a PCR amplification this relevant 
epigenetic information, methylated cytosine or unmethy- 
lated cytosine, will be lost completely. 

This problem is usually solved by treating the genomic 
DNA with a chemical leading to a conversion of the cyto- 
sine bases, which consequently allows to differentiate 
the bases afterwards. 

A tool most useful for analyzing DNA methylation is the 
bisulfite conversion of DNA that converts cytosine bases 
into bases showing a hybridization behavior as thymin 
bases. Hereby the DNAs complexity is reduced by a fourth. 

Bisulfite conversion is the most frequently used method 
for analyzing DNA for 5-methylcytosine . It is based upon 
the specific reaction of bisulfite with cytosine which, 
upon subsequent alkaline hydrolysis, is converted to 
uracil, whereas 5-methylcytosine remains unmodified under 
these conditions (Shapiro et al. (1970) Nature 227: 
1047). However, in its base pairing behavior, uracil cor- 
responds to thymine, that is, it hybridizes to adenine; 
whereas 5-methylcytosine doesn't change its chemical 
properties under this treatment and therefore still has 
the base pairing behavior of a cytosine, that is hybrid- 
izing with guanine. Consequently, the original DNA is 
converted in such a manner that methyl-cytosine, which 
originally could not be distinguished from cytosine by 
its hybridization behavior, can now be detected as the 
only remaining cytosine using "normal" molecular biologi- 
cal techniques, for example, by amplification and hy- 
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bridization or sequencing. All of these techniques are 
based on base pairing which can now be fully exploited. 
Comparing the sequences of the DNA prior to and after bi- 
sulfite treatment allows an easy identification of those 
bases that have been methylated. 

In the scope of this invention when it says "a nucleotide 
(...) was converted by the treatment..." this conversion is 
meant to be able to differentiate between methylated and 
un-methylated cytosine bases within said sample, as for 
example the conversion of un-methylated cytosine bases to 
bases which hybridize to adenine by the treatment with 
bisulfite . 

An alternative method is to use restriction enzymes that 
are capable of differentiating between methylated and un- 
methylated DNA, but this is restricted in its uses due to 
the selectivity of the restriction enzyme towards a spe- 
cific sequence. 

An overview of the further known methods of detecting 5- 
methylcytosine may be gathered from the following review 
article: Rein T, DePamphilis ML, Zorbas H, Nucleic Acids 
Res. 1998, 26, 2255. 

In terms of sensitivity, the prior art is defined by a 
method, which encloses the DNA to be analyzed in an aga- 
rose matrix, thus preventing the diffusion and renatura- 
tion of the DNA (bisulfite reacts with single-stranded 
DNA only) , and which replaces all precipitation and puri- 
fication steps with fast dialysis (Olek A, Oswald J, Wal- 
ter J (1996) A modified and improved method for bisulfite 
based cytosine methylation analysis. Nucleic Acids Res. 
24: 5064-6). Using this method, it is possible to analyze 
individual cells, which illustrates the potential of the 
method . 




To date, barring few exceptions (e.g., Zeschnigk M, Lich 
C, Buiting K, Doerfler W, Horsthemke B (1997) A single- 
tube PCR test for the diagnosis of Angelman and Prader- 
Willi syndrome based on allelic methylation differences 
at the SNRPN locus. Eur J Hum Genet. 5: 94-8) the bisul- 
fite technique is only used in research. Always, however, 
short, specific fragments of a known gene are amplified 
subsequent to a bisulfite treatment and either completely 
sequenced (Olek A, Walter J (1997) The pre-implantation 
ontogeny of the HI 9 methylation imprint. Nat Genet. 3: 
27 5-6) or individual cytosine positions are detected by a 
primer extension reaction (Gonzalgo ML and Jones PA 
(1997) Rapid quantitation of methylation differences at 
specific sites using methylat ion-sensitive single nucleo- 
tide primer extension (Ms-SNuPE) . Nucleic Acids Res. 25 
-.2529-31; WO 95/00669) or by enzymatic digestion (Xiong 
Z, Laird PW (1997) COBRA: a sensitive and quantitative 
DNA methylation assay. Nucleic Acids Res. 25: 2532-4) . 

Another technique to detect hypermethylation is the so 
called methylation specific PCR (MSP) (Herman JG, Graff 
JR, Myohanen S, Nelkin BD and Baylin SB (1996), Methyla- 
tion-specific PCR: a novel PCR assay for methylation 
status of CpG islands. Proc Natl Acad Sci USA. 93: 9821- 
6) . The technique is based on the use of primers that 
differentiate between a methylated and a non-methylated 
sequence if applied after bisulfite treatment of said DNA 
sequence. The primer either contains a guanine at the po- 
sition corresponding to the cytosine in which case it 
will after bisulfite treatment only bind if the position 
was methylated. Or the primer contains an adenine at the 
corresponding cytosine position and therefore only binds 
to said DNA sequence after bisulfite treatment if the cy- 
tosine was unmethylated and has hence been altered by the 
bisulfite treatment so that it hybridizes to adenine. 
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With the use of these primers amplicons can be produced 
specifically depending on the methylation status of a 
certain cytosine and will as such indicate its methyla- 
tion state. The present invention, however, does prefera- 
bly not include CpGs in the primer sequence. 

Another new technique is the detection of methylation via 
Taqman PCR, also known as MethylLight (WO 00/70090). With 
this technique it became feasible to determine the methy- 
lation state of single or of several positions directly 
during PCR, without having to analyze the PCR products in 
an additional step. 

In addition, detection by hybridization has also been de- 
scribed (WO 99/28498) . 

Further publications dealing with the use of the bisul- 
fite technique for methylation detection in individual 
genes are: 

Grigg G, Clark S (1994) Sequencing 5-methylcytosine resi- 
dues in genomic DNA. Bioassays 16: 431-6; Zeschnigk M, 
Schmitz B, Dittrich B, Buiting K, Horsthemke B, Doerfler 
W (1997) Imprinted segments in the human genome: differ- 
ent DNA methylation patterns in the Prader-Willi /Angelman 
syndrome region as determined by the genomic sequencing 
method. Hum Mol Genet. 6:, 387-95; Feil R, Charlton J, 
Bird AP, Walter J, Reik W (1994) Methylation analysis on 
individual chromosomes: improved protocol for bisulphite 
genomic sequencing. Nucleic Acids Res. 22: 695-6; Martin 
V, Ribieras S, Song-Wang X, Rio MC, Dante R (1995) Ge- 
nomic sequencing indicates a correlation between DNA hy- 
pomethylation in the 5' region of the pS2 gene and its 
expression in human breast cancer cell lines. Gene 157 : 
261-4; WO 97/46705; WO 95/15373; WO 97/45560 
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For all those methods mentioned above, which are based on 
PCR amplification of bisulfite treated DNA, the biggest 
challenge is to design primers that are specific. 



There are a number of programs available on the market 
that offer to design primer pairs in order to amplify a 
piece of DNA in a PCR, Usually they require as input the 
template DNA sequence, the preferred melting temperature 
TM, the desired length of the amplificate and optionally 
the preferred length of the primer molecules. 

However if a primer is required to bind specifically to 
bisulfite treated DNA, the design of the primer molecule 
is especially difficult and those tools known in the art 
are not competent to design primers that lead to specific 
products. The following problems occur when dealing with 
bisulfite treated DNA instead of standard DNA: 

First, the sequence complexity of the bisulfite treated 
genome is reduced dramatically. Complexity in this con- 
text is meant to be a measure for the similarity of a 
given sequence to a random or stochastic sequence; the 
more complex a sequence is the more it is similar to a 
random sequence. A reduced complexity of the genome means 
there are less degrees of variation. Where there are es- 
sentially only three different nucleotides rather than 
four, the probability of a sequence to occur twice in a 
given length of sequence is much higher. For example, a 
primer molecule of 20 nucleotides in length is likely to 
be unique in the human genome, if it is not part of a re- 
peat sequence: The human genome is known to consist of 
about 3 x 109 bases. There are 420 « 1012 different ways 
to form sequences of a length of 20 nucleotides, assuming 
equidistribution of the bases, which makes multiple oc- 
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currences of a given 20-mer (oligonucleotide of 20 nu- 
cleotides) extremely unlikely. However since there are 
only 320 * 3 x 109 different 20-mers possible over a 3- 
letter alphabet , this multiple occurrence cannot be ex- 
cluded. In addition a bisulfite treated sequence, en- 
riched in thymine in the sense strand and enriched in 
adenine in the reverse complementary strand, will contain 
more repeats and regions of general low complexity. 

Another way to enhance or guarantee uniqueness of primer 
and/or oligo molecules Is to estimate their expected fre- 
quency in the genome based upon a Markov model of order n 
for the human genome or to check their uniqueness explic- 
itly by counting their exact occurrence. The estimation 
based upon the Markov model relies upon the determination 
of the probabilities of all 4n h-mers (oligo molecules of 
n nucleotides) in the human genome or in all amplificates 
which are used in the hybridization and the conditional 
probabilities of all four bases given these n-mers. The 
primer pairs will be constructed from forward and reverse 
oligos which lie within an appropriate distance to each 
other and which have minimal individual expected occur- 
rence elsewhere in the genome. 

A second challenge in primer design for bisulfite treated 
DNA is that the melting temperature TM of a bisulfite DNA 
primer of a certain length is typically lower than the 
melting temperature TM of a standard primer containing 
cytosines. This is due to the fact that every cytosine in 
a bisulfite treated DNA is - after amplification by PCR - 
replaced by thymine. Cytosine binds its corresponding 
base guanine via three hydrogen bonds , whereas thymine 
binds its corresponding base adenine via two hydrogen 
bonds only, leading to a generally weaker binding, a 
lower TM. 
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A third problem arises from the fact that bisulfite 
treated sequences are not only lacking cytosines but are 
also thymine-rich. Thymine also hybridizes unspecif ically 
with guanine. This makes mismatching (unspecif ic binding 
of a primer to a sequence not identical) of a primer de- 
signed for bisulfite treated DNA much more likely than 
mismatching of a standard primer consisting of four dif- 
ferent nucleotides • 

It is the aim of this invention to overcome these prob- 
lems, which are specific for primer based amplification 
of bisulfite treated DNA. 

For a so called "multiplex PCR" it becomes especially 
difficult to design primer pairs- This expression is used 
to describe an experiment in which several different 
pieces of DNA are amplified simultaneously, in one reac- 
tion vessel and at the same time. Obviously this saves a 
lot of effort and time and is as such a basic requirement 
for high throughput assays based on PCR amplification. An 
overview on the state of the art concerning multiplex PCR 
is given by Henegariu et al . (Henegariu 0, Heerema NA, 
Dlouhy SR, Vance GH and Vogt PH (1997) Multiplex PCR: 
Critical Parameters and Step-by-Step Protocol. BioTech- 
niques 23: 504-511), who offer a step-by-step . protocol on 
how to tackle multiplex PCR problems. However, the possi- 
bility of a special primer design is not mentioned in 
this article. 

To ensure that the multiplex PCR works and the multiple 
products are amplified indeed usually a gel electrophore- 
sis of the reaction mixture is performed. The products 
get separated due to their different sizes. Unfortu- 
nately, the ability of agarose gel electrophoresis to 
distinguish the products is slightly limited. However, it 
is possible to test for different product sizes with the 




means of a fragment analyzer, which is much more accurate 
and ab.le to distinguish product sizes of one base differ- 
ence. Hence different product sizes are no longer a re- 
quirement to be considered in the primer design for a 
multiplex PCR. 

In patent WO 01/94634 a method for a multiplex PCR using 
at least two primer pairs is described that consists of 
basically a two step amplification procedure wherein one 
step is referred to as pre-amplif ication. After pre- 
amplification (by means of PCR) with a number of primer 
pairs the sample gets divided into as many portions as 
there are primer pairs. At least one (and preferably only 
one) of the previously used primer pairs is added. This 
method doesn't relate in any way to the selection or de- 
sign of primer molecules described herein. 

In an article by Shuber et al . (Shuber AP, Grondin VJ and 
Klinger KW (1995) A simplified procedure for developing 
multiplex PCRs . Genome Res 5 (5) : 488-493 ) regarding 
multiplex PCR, the authors suggest to use primers, which 
contain a 3' region complementary to sequence specific 
recognition sites and a 5' region of a defined length of 
20 nucleotides each. The authors claim that they could 
establish identical reaction conditions, cycling times 
and annealing temperatures for any PCR primer pair fol- 
lowing those requirements . 

In several recent papers successful multiplex PCRs have 
been established. For example, Becker et al . have re- 
ported the development of a multiplex PCR reaction for 
the detection of multiple staphylococcal enterotoxin 
genes, which uses individual primer sets for each toxin 
gene (Becker K, Roth R and Peters G (1998) Rapid and spe- 
cific detection of toxigenic Staphylococcus aureus : use 
of two multiplex PCR enzyme immunoassays for amplifica- 
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tion and hybridization of staphylococcal enterotoxin 
genes, exfoliative toxin genes, and toxic shock syndrome 
toxin 1 gene. J. Clin. Microbiol. 36: 2548-2553). This 
has been developed even further by Monday and Bohach, by 
increasing the number of primer pairs applied in one re- 
action up to about 10 in order to have one assay to am- 
plify all of the characterized enterotoxin genes. This 
still required a unique established primer pair for the 
detection of every individual gene (Monday SR and Bohach 
GA (1999) Use of multiplex PCR to detect classical and 
newly described pyrogenic toxin genes in staphylococcal 
isolates. J. Clin. Microbiol. 37: 3411-3414). 

In another paper by Sharma et al. a method for a one- 
vessel-multiplex PCR is described wherein each of six 
chosen primer pair consists of one identical universal 
forward primer, based on a highly conserved region of 
those genes of interest and one reverse primer, specific 
for each individual gene. As such the assay leads to a 
rapid amplification of a family of genes, which all have 
a conserved region in common. It is designed to detect 
presence or absence of certain genes in an unknown mixtu- 
re. No further information is given about the primer de- 
sign, apart from saying that they were designed by a- 
lignment of published DNA sequences. This is certainly 
not the only requirement though, as one big limitation of 
the method is the need of getting PCR products of diffe- 
rent sizes in order to identify those in the end (Sharma 
NK, Rees CED and Dodd CER (2000) Development of a single- 
reaction multiplex PCR toxin typing assay for Staphylo- 
coccus aureus strains. Applied and Environmental Microbi- 
ology 66 (4) : 1347-1353) . 

In the patent application WO 01/36669 a method is descri- 
bed which uses a similar approach for the controllable 
amplification of a higher number of sequences in selec- 



ting one randomly chosen reverse primer that hybridizes 
unspecif ically and a number of specific forward primers 
to amplify a group of sequences. As the reverse primer is 
labeled all products formed will be labeled as well. By 
hybridizing said amplicons towards immobilized detection 
oligos, which are able to differentiate the products , it 
will be easy to see which products have been amplified 
and herein the presence or absence of said sequences in 
the mixture can be determined. 

The big disadvantage in all these methods is that every 
primer pair needs to be established individually first to 
ensure that a PCR product of the expected size was pro- 
duced and that no additional or nonspecific products are 
generated. Once the specificity of the primer pairs had 
been determined,. PCR conditions, buffers, and primer con- 
centrations need to be optimized to establish conditions 
under which the primer molecules can be combined into one 
single PCR reaction without affecting the ability, of the 
primer pairs to generate a gene specific amplicon. 

A more recently published approach by Nicodeme and 
Steyaert describes the conditions required for multiplex 
PCR and suggests an algorithm to automatically select for 
primer pairs (Nicodeme P and Steyaert JM (1997) Selecting 
optimal oligonucleotide primers for multiplex PCR. Proc. 
Int. Conf . Intell Syst Mol Biol; 5 : 210-213) . In this 
approach the conditions for pre- selecting primer pairs 
for a successful one locus amplification (singleplex PCR 
conditions) are rather broad. The three basic require- 
ments are the pairing distance between a forward and a 
reverse primer, the condition of non-palindromicity of a 
primer, and the condition that the 3' end of a primer 
must not be reverse complementary to any of the other 
primers sequence. This selection is done with the help of 
a typical primer design program called PRIMER. However, 
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PRIMER is a two step program, and in this approach the 
new method to design primers for a multiplex PGR takes 
the output from step 1 as input, which is. a list of pos- 
sible forward and a list of possible reverse primers for 
every amplificate. 

The only further selection criteria for the multiplex PCR 
primers are the absence of the reverse complementarity of 
their 3' end towards the other primer sequences in the 
experiment. A second critical factor considered here is • 
the GC versus AT ratio. To some extent it is this ratio 
that determines the melting temperature of a primer pair. 
The authors suggest to limit the GC/AT ratio to be inside 
a given range which would enable the simultaneous hy- 
bridization of several primer pairs at one reaction tem- 
perature. The final requirement is the electrophoresis 
distance, determined by the tool that is used to differ- 
entiate the PCR products in, for example, a gel electro- 
phoresis. This most common method requires the products 
to be of different sizes. The whole concept of this 
method also requires to have a pool of possible primer 
pairs for each amplicon. 

The design of suitable primers for a multiplex PCR on bi- 
sulfite treated DNA is an even greater challenge. The low 
complexity of the DNA, being reduced to essentially three 
different bases rather than four different bases, requi- 
res an extra careful selection of primers to avoid mis- 
matching and unwanted amplification. 

In the scope of this invention the word "mismatching" 
corresponds to the situation when the alignment of two 
sequences which are essentially complementary reveals po- 
sitions in one of the sequences where the nucleotide base 
does not align with its corresponding base but a differ- 
ent one. The corresponding or complementary base pairs 



are adenine and thymine, cytosine and guanine, uracil and 
adenine. For example , a cytosine that aligns with a thy- 
mine in its otherwise complementary sequence creates a 
mismatch of one base or nucleotide. 

Accordingly "base mismatches" refers to the situation of 
a base mismatching with another as explained above , re- 
spectively "one or more base mismatches" refers to one or 
more bases (in a given sequence) that cannot be aligned 
with their corresponding bases. 

Also, when the alignment reveals single nucleotide gaps 
in one of the aligned sequences this is understood under 
the term "mismatch" in the scope of this invention. 

A *gap' is to be understood as follows: If an alignment 
reveals that, in order to get the highest number of cor- 
responding base pairs aligned, some bases are lacking a 
corresponding base in its otherwise complementary se- 
quence, this is called a gap. Such a gap can have a 
length of one or more nucleotides. 

To solve the problems mentioned above we invented a 
method consisting of several steps that is applicable for 
the amplification of nucleic acids in singleplex as well 
as in multiplex PCR experiments. 

SUMMARY OF THE INVENTION 

The method is comprised of the following steps: 
Firstly, the nucleic acid sample containing the region of 
interest, which is to be amplified, is isolated. Sec- 
ondly, this nucleic acid sample is treated in a manner 
that differentiates between methylated and un-methylated 
cytosine bases within said sample. Thirdly, a reaction 
mixture is set up containing a) the treated template nu- 





cleic acids, carrying the region of interest (also 
called: target nucleic acid) that is to be amplified, b) 
specified oligo-nucleotide primers, c) an enzyme capable 
of amplifying said nucleic acids in a defined manner, d) 
5 the necessary nucleotides required for the nucleic acid 
synthesis and e) a suitable buffer. 

Said- specified oligo-nucleotide primers are characterized 
in that 

10 their sequences each reach a predefined measure of com- 
plexity (as described in detail below) 

every possible combination of two primer molecules in 
said reaction mixture has a melting temperature below a 
specified threshold temperature 
15 none of the possible combinations of two primer molecules 
in said reaction mixture leads to the amplification of an 
additional unwanted product as determined by virtual 
testing for amplification. 

2 0 In the last step of the method said amplified target nu- 
cleic acid is detected by means commonly used by one 
skilled in the art. 

The invention is composed of a method for the amplif ica- 

125 tion of nucleic acids comprising the following steps of 
I isolating a nucleic acid- sample, treating said sample in 

a manner that differentiates between methylated and un- 
methylated cytosine bases within- said sample, amplifying 
at least one target sequence, within said treated nucleic 
30 acid, by means of enzymatic amplification and a set of 

primer molecules, wherein said primer molecules are char- 
acterized in that 

a) each primer molecule sequence reaches a predefined 
measure of complexity, b) every combination of any two 
35 primer molecules in the set has a melting temperature be- 
low a specified threshold temperature and c) every combi- 
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nation of two primer molecules, under conditions allowing 
for one or more base mismatches per primer, does not lead 
to the amplification of an unwanted product when virtu- 
ally tested using the treated and the untreated sample 
nucleic acids as template and the last step of detecting 
said amplified target nucleic acid. 



More detailed description of the method: 



The method is comprised of the following steps: 
In the first step of the method, the nucleic acid sample, 
which contains the region of interest that is to be am- 
plified, must be isolated from tissue or cellular 
sources. Such sources may include at least one cell, but 
usually several cells, cell lines, histological slides, 
bodily fluids, or tissue embedded in paraffin. 

In a preferred embodiment of this invention the nucleic 
acid sample is isolated from a bodily fluid, a cell cul- 
ture, a tissue sample or a combination thereof. 

For example a certain kind of organ sample from a patient 
or an animal can be used to extract genomic DNA by the 
usually applied methods. Preferably, in this invention 
DNA is extracted from a tissue sample or a biological 
fluid like blood, serum, urine or other fluids. 'Bodily 
fluid' herein refers to a mixture of macromolecules ob- 
tained from an organism. This includes, but is not lim- 
ited to, blood, blood plasma, blood serum, urine, sputum, 
ejaculate, semen, tears, sweat, saliva, lymph fluid, 
bronchial lavage, pleural effusion, peritoneal fluid, 
meningal fluid, amniotic fluid, glandular fluid, fine 
needle aspirates, nipple aspirate fluid, spinal fluid, 
conjunctival fluid, vaginal fluid, duodenal juice, pan- 
creatic juice, bile and cerebrospinal fluid. This also 
includes experimentally separated fractions of all of the 




preceding. 'Bodily fluid 1 also includes solutions or mix- 
tures containing homogenized solid material, such as fe- 
ces . 

The nucleic acids may include DNA or RNA. Isolation may 
be by means that are standard to one skilled in the art, 
this includes for example extraction of DNA with the use 
of detergent lysates, sonification and vortexing with 
glass beads. An example is the extraction of DNA from a 
piece of a plant, like a leave or fruit. Once the nucleic 
acids, like genomic double stranded DNA, have been ex- 
tracted they are used in the analysis. 

In a preferred embodiment of this invention the nucleic 
acid sample is comprised of plasmid DNA, BACs (bacterial 
artificial chromosomes) , YACs (yeast artificial chromo- 
somes) or genomic DNA. 

In another especially preferred embodiment of this inven- 
tion the nucleic acid sample is comprised of human ge- 
nomic DNA. It is preferred that the nucleic acids are of 
human origin. 

In the second step, this nucleic acid sample is treated 
in a manner that differentiates between methylated and 
un-methylated cytosine bases within said sample. Cytosine 
bases which are unmethylated at the 5' -position are con- 
verted to uracil, thymine, or another base which is dis- 
similar to cytosine in terms of hybridization behavior. 
This will be understood as 'treatment 1 hereinafter. The 
method most commonly used so far is the so called bisul- 
fite treatment. 

This step is of essential meaning to the process as it 
translates the methylation pattern of said nucleic acids 
into a pattern that is something like an imprint of the 




methylation status itself. It contains essentially the 
same information but the pre-treated nucleic acids are no 
longer sensitive to amplification via PCR. Amplification 
via PCR does not differentiate between methylated and un- 
methylated cytosines and therefore leads to the loss of 
this level of information. The original methylation 
status however can be deducted whenever the described 
pre-treatment had been performed prior. to the amplifica- 
tion step. Hence any means suitable to differentiate be- 
tween a methylated and an un-methylated cytosine base are 
applicable , as long as the modified bases are still capa- 
ble of being amplified by enzymatic means after treat- 
ment . 

It is a preferred embodiment of this invention that said 
sample is treated by means of a solution of a bisulfite, 
hydrogen sulfite or disulfite. A treatment of genomic DNA 
as described above is carried out with bisulfite (hydro- 
gen sulfite, disulfite) and subsequent alkaline hydroly- 
sis which results in a conversion of non-methylated cyto- 
sine nucleobases to uracil or to another base which is 
dissimilar to cytosine in terms of base pairing behavior. 

In the third step of this method, a reaction mixture is 
set up containing a) the treated template nucleic acids, 
comprising the region of interest (also called target nu- 
cleic acid) that is to be amplified, b) specified oli- 
gonucleotide primers, c) an enzyme capable of amplifying 
said nucleic acids in a defined manner, for example a po- 
lymerase, d) the necessary nucleotides required for the 
nucleic acid synthesis and e) a suitable buffer. The tem- 
plate nucleic acid contains at least one target nucleic 
acid, which is amplified in the reaction. One primer 
molecule of the at least one primer pair in the reaction 
mixture is capable of binding to the 3' end of one speci- 
fied target nucleic acid. The first primer binds to the 
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3' end of the target sequence, this primer is elongated 
and a complementary sequence to the target sequence is 
made. The polymerase stops to elongate unspecif ically • 
The next cycle starts by thermally denaturing the now 
double stranded template nucleic acid into single 
stranded template nucleic acids. This is followed by the 
next phase of annealing when both primer molecules spe- 
cifically bind to the target nucleic acid and its comple- 
mentary strand. The second primer is identical to the 5' 
end of the target molecule. It doesn't bind to the target 
sequence itself but to said complementary nucleic acid to 
the target sequence, as soon as this is denatured from 
the template. 

The process is finished by the actual amplification phase 
at a slightly lower reaction temperature, during which 
the enzyme, for example the polymerase elongates the 
primer as a complementary sequence to the target nucleic 
acid. The polymerase elongates this second primer by us- 
ing the first copy as template until the end of said cop- 
ied nucleic acid is reached. That way an identical copy 
to the original single stranded target nucleic acid is 
created. Hence, the length of the amplificate is deter- 
mined by choosing the two primers. 

The elongation products, being complementary to each 
other and hereby building a double stranded version of 
the target nucleic acid, serve as additional targets for 
the primer molecules binding in the next cycle of ampli- 
fication. 

Essentially step 3 of the method. is comprised of amplify- 
ing at least one target sequence, within said treated nu- 
cleic acid, by means of enzymatic amplification and a set 
of primer molecules. 



• . • • • • 

22 

Said primer molecules used in said method are character- 
ized in that they, in addition to fulfilling all the 
usual requirements towards a PCR primer as will be speci- 
fied in more detail later, also fulfill the following re- 
quirements: 

Firstly, the sequence of each primer molecule used in 
step 3 of this method reaches a predefined measure of 
complexity. 

In a preferred embodiment of this method the primer mole- 
cules are reaching a certain value of linguistic complex- 
ity. A notion and a measure of linguistic complexity has 
been introduced by Trifonov in 1990 and has been used for 
analysis of nucleotide sequences before (Trifonov, EN 
(1990) Making sense of the human genome. In Structure & 
Methods. Vol 1 pp 69-77 (eds. Sarma, RH and Sarma MH, 
Adenine Press, Albany, US) . The linguistic complexity 
technique allows a calculation to be made of the struc- 
tural complexity of any linear sequence of characters ir- 
respective of whether the text is cognized or presently 
undeciphered . The sequences are compared exclusively from 
the point of view of their structural complexity with no 
reference to the meaning of the texts. In 1997 Trifonov 
published how the linguistic complexity of nucleosomal 
sequences is defined (Bolshoy, A; Shapiro, K; Trifonov, E 
and Ioshikhes I. (1997) Enhancement of the nucleosomal 
pattern in sequences of lower complexity. NAR 25 (16) : 
3248-3254) . Quote: "The linguistic complexity measure ex- 
ploits the major distinguishing feature between natural 
nucleotide sequences and uniformly random ones : the re- 
petitiveness of the natural sequences, i.e. the frequent 
repetition, not necessarily a tandem one, of some oli- 
gonucleotides ("words"), while others are avoided. (...) 
Complexity can be directly calculated as the extent to 
which the maximal possible vocabulary (all word sizes 
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considered) is utilized in a given strength of sequence 
(...) ." 

In another preferred embodiment of. this method said meas- 
ure of complexity is set by the so called Shannon entropy 
(Shannon, C E, (1948) A Mathematical Theory of Communica- 
tion, University of Illinois Press, Urbana) . This is the 
most common measure to assess the information content (in 
a technical, non-semantic meaning) of linear information 
carriers. It attributes the maximal value (which can be 
chosen to be 1 without restrictions) to sequences where 
all symbols (characters) occur at equal probability and a 
value of 0 to sequences consisting of just one repeated 
symbol (character, letter) . A derived and more general 
measure is the higher order Shannon entropy which attrib- 
utes maximal value to sequences where all its subse- 
quences occur at equal probability and a value of 0 or 
close to 0 to sequences consisting of periodic repeti- 
tions of short subsequences. The practical determination 
of the (higher order) Shannon entropy however is limited 
by the finite lengths of sequences which often does not 
permit a precise estimation of the probability distribu- 
tion of their constitutive symbols. 

Further possible measures are for example the Lempel-Ziv 
complexity (Lempel, LB and Ziv, J (197 6) On the complex- 
ity of finite sequences. IEEE Trans. Inf. Theory IT-22, 
75-81) , the grammar complexity (Ebeling, W; Jimenez- 
Montano, MA (1980) On Grammars, Complexity and Informa- 
tion Measures of Biologoical Macromolecules . Mathematical 
Bioscience 52, 53-71), the algorithmic complexity 
(Chaitin, 1990) and the conditial entropy. 

Secondly, said primer molecules are also characterized in 
that every possible combination of any two primer mole- 
cules, in the set, has a melting temperature below a 
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specified threshold temperature. That way the accumula- 
tion of dimers caused by the binding of two primer mole- 
cules to each other in said reaction mixture is excluded. 
The number of primer pairs used in that step can be any 
between one and n, leading to one or n amplificates re- 
spectively (n being a natural number) . 

As mentioned in the text the word "dimer" refers to a 
secondary structure formed by the hybridization of two 
primer molecules to each other. 

As referred to in the text ^melting temperature' refers 
to the temperature at which 50% of the nucleic acid mole- 
cules are in duplex and 50% are denatured under standard 
reaction solution conditions. 

Some primer design tools disqualify a primer if, besides 
the target sequence, a second identical sequence can be 
found in the template. However, due to the higher prob- 
ability of a bisulfite primer to mismatch with non- 
identical bisulfite treated DNA, it is an embodiment of 
this invention that only those primers are allowed to be 
used in said amplification method, for which no sequence 
homology can be found, to the extent that even those se- 
quences that are different and/or mismatching in several 
nucleotides are excluded. However, this would exclude 
primer molecules unnecessarily. Therefore they are only 
excluded if two primer molecules match to the template in 
a distance allowing for the amplification of an unwanted 
product. This test is performed by means as, for example, 
the Electronic PCR. -Electronic PCR (e-PCR) is an in 
silico virtual PCR carried out in order to assess the 
suitability of primer molecules prior to in vitro PCR. In 
the scope of this invention this testing will be called 
^virtual testing' and it will be referred to as "virtu- 
ally tested" or "virtually testing". 




Thirdly, the primers used in step 3 of this invention are 
characterized, in that every possible combination of two 
primer molecules, in said reaction mixture, does not lead 
to the amplification of an additional unwanted product, 
when virtually testing for amplification using the 
treated and the untreated nucleic acid sample as tem- 
plate, even under conditions allowing for at least one 
base but not more than 20% of the total number of bases 
per sequence mismatching per primer. In the scope of this 
invention it is to be understood that those primer mole- 
cules are considered to bind to the template for which a 
template sequence exists that is in at least 80% of its 
nucleotide sequence identical to the target sequence the 
primer originally has been designed for. For example, a 
primer molecule of 50 nucleotides length is considered to 
still hybridize to a template sequence that differs in 
less than 11 nucleotides (= is identical in at least 80% 
of its nucleotide sequence) from the according target se- 
quence. If a match is considered to be possible it has to 
be tested whether this match would lead to the amplifica- 
tion of an unwanted product. This can be done with the 
use of a program similar to e-PCR (see below) . 

Especially preferred is an embodiment of. said method 
wherein the ability of said primer molecules to amplify 
an unwanted product is tested by means of in silico PCR, 
taking as template nucleic acid the coding strand of the 
treated sample , the non-coding strand of the treated sam- 
ple and both of the strands of the untreated sample. It 
is especially preferred to perform the virtual testing 
with a tool like electronic PCR on the pretreated, pref- 
erably bisulfite treated, template sequence consisting of 
the treated sense and the treated anti-sense strand, and 
on the unconverted template. 
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Furthermore it is preferred that this treatment is bisul- 
fite treatment and hence the nucleic acid template is the 
bisulfite converted coding strand of the human genome, 
the bisulfite converted non-coding strand of the human 
genome and both of the strands of the untreated human ge- 
nome. Preferred is an embodiment of said method wherein 
the ability of said primer molecules to amplify an un- 
wanted product, is tested by means of electronic PCR, 
hereby taking as template nucleic acid' the bisulfite con- 
verted coding strand of the human genome, the bisulfite 
converted non-coding strand of the human genome and both 
of the strands of the untreated human genome. 

It is preferred that the number of mismatches allowed for 
when virtually testing- the amplification of unwanted 
products according to step 3 c) of the invention is less 
than 20% of the number of nucleotides of the primer. 

It is also preferred that the number of mismatches al- 
lowed for when virtually testing the amplification of un- 
wanted products according to step 3c) of the invention 
is less than 10% of the number of nucleotides of the 
primer. 

s 

It is especially preferred that the number of mismatches 
allowed for when virtually testing the amplification of 
unwanted products according to step 3 c) of the invention 
is less than 5% of the number of nucleotides of the 
primer. 

It is a preferred embodiment of this invention that the 
number of mismatches allowed for when virtually testing 
the amplification of unwanted products according to step 
3 c) of the invention is less than seven. 
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It is especially preferred that the number of mismatches 
allowed for when virtually testing the amplification of 
unwanted products according to step 3 c\ of the invention 
is less than five. 

It is another preferred embodiment of this invention that 
the number of mismatches allowed for when virtually test- 
ing, the amplification of unwanted products according to 
step 3 c) of the invention is less than three. 

It is especially preferred in the scope of this invention 
that the number of mismatches allowed for when virtually 
testing the amplification of unwanted products according 
to step 3 c) of the invention is one. 

It is also included in the scope of this invention to 
consider such primer molecules as being sufficiently 
similar to facilitate their binding to the template se- 
quence , for which a template sequence can be found that 
differs in the number of nucleotides but is otherwise 
identical to the target sequence. When the alignment of 
the primer and the template sequence leads to a gap of up 
to 20% of the nucleotides of one sequence, preferably of 
the primer sequence, this shall still be considered to be 
sufficient for binding and hence potentially leading to 
the amplification of an unwanted product. Therefore these 
primers also need to be tested with the means of virtual 
PCR (for example with a program like e-PCR) . Only if this 
test reveals the virtual amplification of an unwanted 
product caused by the combination of two primers, the ac- 
cording primer pairs are excluded from the set of se- 
lected pairs. 

It is preferred that the number of nucleotides creating 
one gap, in one of the sequences, when aligning the pri- 
mer molecule sequence with the template sequence, allowed 
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for when virtually testing the amplification of unwanted 
products according to step 3 c) of the invention is less 
than 20% of the number of nucleotides of the primer mole- 
cule . 

It is also preferred that the number of nucleotides cre- 
ating one gap, in one of the sequences/ when aligning the 
primer molecule sequence with the template sequence, al- 
lowed for when virtually testing the amplification of un- 
wanted products according to step 3c) of the invention 
is less than 10% of the number of nucleotides of the pri- 
mer molecule. 

It is preferred that the number of nucleotides creating 
one gap, in one of the sequences, when aligning the 
primer molecule sequence with the template sequence, al- 
lowed for when virtually testing the amplification of un- 
wanted products according to step 3c) of the invention 
is less than 5% of the number of nucleotides of the 
primer molecule. 

Both of these situations, mismatching due to an alterna- 
tive nucleotide or no-matching due to a missing nucleo- 
tide, are meant to be covered in the expression describ- 
ing those primer molecules that will eventually be se- 
lected : "said primer molecules are characterized in that 
every combination of two primer molecules, under condi- 
tions allowing for one or more base mismatches per 
primer, does not lead to the amplification of an' unwanted 
product when virtually tested using the treated and the 
untreated sample nucleic acids * as template". 

It is also preferred that the primer molecules that ex- 
ceed a pre-specif ied melting temperature when binding to 
the template have to be virtually tested for amplificati- 
on of unwanted products using the treated and the untrea- 
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ted sample nucleic acids as template according to step 3 
c) of the method. 

The basic problem of finding a primer specific enough to 
give only one product on the little complex bisulfite 
DNA, is finally solved by testing each potential primer 
pair for hybridization across the whole bisulfite con- 
verted human genome. This requires translating the whole 
human genome sequence information virtually into its bi- 
sulfite treated version before performing a similarity 
search against the primer pairs, which is based on a 
method like the so called e-PCR (Schuler G.D. (1997) Se- 
quence Mapping by electronic PCR. Genome Research 7(5): 
541-550) . However, as the bisulfite conversion results in 
two no longer complementary strands this virtual hybridi- 
zation test needs to be done against both bisulfite con- 
verted strands. In addition in most cases the template 
DNA is contaminated with unconverted genomic DNA. To also 
exclude unwanted amplification on the unconverted DNA as 
template, the same hybridization test has to be performed 
a third time using the whole human genome sequence as a 
template . 

Therefore it is a preferred embodiment of this invention 
that the ability of said primer molecules to amplify an 
unwanted product is tested by means such as electronic 
PCR. 

In the last step of the method said amplified target nu- 
cleic acid gets detected by any means standard to one 
skilled in the art. 

In a preferred embodiment of this method the set of 
primer molecules is comprised of at least two primer 
molecules but not more than 64 primer molecules, given 
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the number is a multiple of 2; in other words, the set is 
comprised of 1-32 primer pairs . 

In another preferred embodiment of this method the set of 
primer molecules is comprised of between 2 and 32 primer 
molecules, given the number is a multiple of 2; in other 
words the set is comprised of 1-16 primer pairs. 

In a preferred embodiment of this method, said primer 
molecule comprises at least one nucleotide within the 
last three nucleotides from the 3' end of the molecule, 
wherein said nucleotide is complementary to a nucleotide 
of the target sequence that, as a result of the treatment 
performed in step 2) of the invention, changed its hy- 
bridization behavior. 

It is a preferred embodiment of this method, that said 
primer molecule comprises at least one ' nucleotide within 
the last three nucleotides from its 3' end that is com- 
plementary to a nucleotide of the target sequence that 
was converted by the treatment performed in step 2 of the 
method to another base exhibiting an alternative base 
pairing behavior. 

In an especially preferred embodiment said nucleotide is 
a cytosine prior to the treatment that converts unmethy- 
lated cytosines. In a preferred embodiment said treatment 
is bisulfite treatment. Said primer molecule comprises at 
least one nucleotide within the last three nucleotides 
from the 3' end of the molecule, wherein said nucleotide 
is complementary to a cytosine, that was converted by bi- 
sulfite treatment to another base exhibiting the base 
pairing behavior of thymine. 

This is to exclude binding of said primer molecules to 
the remaining untreated or un-suf f iciently treated nu- 
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cleic acids, which might still serve as template nucleic 
acid in the PCR. 

Furthermore it is a preferred embodiment of this inven- 
tion that said primer molecules do not form loops or 
hairpins on their own or with each other. 

In another preferred embodiment of the method said primer 
molecules do not form dimers with each other. 

In the text the word 'hairpin 1 is taken to mean a secon- 
dary structure formed by a primer molecule when the 3 f 
terminal region of said nucleic acid hybridizes to the 5 1 
terminal region of said nucleic acid forming a double 
stranded stem structure and wherein only the central re- 
gion of the primer is single stranded. 

As described in the text the word 'loop 1 refers to a sec- 
ondary structure formed by a primer molecule when two or 
more nucleotides of said molecule hybridize thereby form- 
ing a secondary structure comprising a double stranded 
structure one or more base pairs in length and further 
comprising a single stranded region between said double 
stranded region. 

The binding of a primer molecules 3' end to any part of a 
second primer molecule in the set needs to be avoided. 
Otherwise the polymerase would extend the first primer 
using the second primer as template, which would lead to 
a new unwanted product, an extended primer, or rather a 
primer-hybrid, which would serve as the preferred tem- 
plate for the next round of the polymerase chain reaction 
and thereby prevent a sufficient amplification of the 
wanted product. 
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Therefore it is another preferred embodiment of this 
method that each of said primer molecules is character- 
ized in that the last at least 5 bases at the 3' end of . 
said primer molecule are not complementary to the se- 
quence of any other primer molecule in the set. 

It is also preferred that said primer molecules do not 
bind to nucleic acids which prior to treatment of step 2 
contained a 5'-CG-3' site. This would lead to a binding 
of the primers to bisulfite treated nucleic acids , spe- 
cifically depending on their cytosines methylation 
status. A CG corresponding primer would bind to the 
treated methylated version only, whereas a primer corre- 
sponding to TG would bind to the treated unmethylated 
version of these nucleic acids only. It is therefore pre- 
ferred that said primer molecules do not contain nucleic 
acid sequences complementary or identical to nucleic acid 
sequences which prior to treatment of step 2 contained a 
5'-CG-3' site. 

In a preferred embodiment of this method said primer 
molecules are of a specified size range. 

It is especially preferred that these primers are com- 
prised of 16-50 nucleotides. 

In a preferred embodiment of this method said primer 
molecules do not comprise sequences that are complemen- 
tary to regions of the target nucleic acids that con- 
tained specified restriction enzyme recognition sites 
prior to the treatment that altered the unmethylated cy- 
tosines base pairing behavior. It is preferred that said 
primers are complementary to target sequences which prior 
to the treatment performed in step 2 of the invention did 
not contain specified restriction enzyme recognition 
sites . 
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By selecting for the right primer molecules also the am- 
plificates sequence is determined. That is why it has to 
be taken into account to only use those primer molecules 
5 that lead to amplification of nucleic acids containing a 
reasonable high number of CpG sites to be analyzed. Due 
to the treatment of step 2 of this invention these CpG 
sites, depending on the methylation status of the cyto- 
sine, are converted and will therefore either appear as 
10 CG dinucleotides or as TG dinucleotides in the amplifi- 
cate. 

It is preferred that said primer molecules amplify re- 
gions of nucleic acids that prior to bisulfite treatment 
15 comprise of more than eight 5'-CG-3' sites also referred 
to as CG dinucleotides. 

It is also preferred that said primer molecules amplify 
regions of nucleic acids that prior to bisulfite treat- 
ment comprise of more than six 5'-CG-3' sites also re- 
ferred to as CG dinucleotides. 
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It is also preferred that said primer molecules amplify 
regions of nucleic acids that prior to bisulfite treat- 
^5 ment comprise of more than four 5'-CG-3' sites also re- 
ferred to as CG dinucleotides and finally it is espe- 
cially preferred that said primer molecules amplify re- 
gions of nucleic acids that prior to bisulfite treatment 
comprise of more than two 5'-CG-3' sites also referred to 
3 0 as CG dinucleotides. 



Said primer molecules lead to amplificates within a 
specified size range. 
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It is a preferred embodiment of this sequence that said 
primer molecules lead to amplif icates which are comprised 
of at least 50 bp but not more than 2000 bp. 

Especially preferred are primer molecules that lead to 
amplif icates which are comprised of at least 80 bp but 
not more than 1000 bp. 

Furthermore a method is preferred wherein said primer 
molecules lead to amplificates of treated nucleic acids 
which prior to the treatment which altered the unmethy- 
lated cytosines base pairing behavior did not contain re- 
striction enzyme recognition sites. Said primer molecules 
lead to amplificates that are amplified regions of the 
treated nucleic acids which prior to the treatment per- 
formed in step 2) of the method did not contain specified 
restriction enzyme recognition sites. 

A further subject of this invention is a method on how to 
produce said primer molecules. The main step of producing 
a primer molecule is determining its sequence. In the 
following the phrase "primer design" will be used instead 
of primer production, whenever it is referred to the step 
of determining said specific primer sequences. Designing 
primer molecules is a process which as such is well known 
to scientists skilled in the art. The programs usually 
used for this purpose are such as PRIMER3 or OSP (Rozen S 
and Skaletsky H (2000) PRIMER3 on the WWW for general us- 
ers and for biologist programmers. Methods Mol Biol 132: 
365-386; Hillier L and Green P (1991) OSP: A computer 
program for choosing PCR and DNA sequencing primers. PCR 
Methods and Applications 1: 124-128). Other primer design 
systems (like described in EP-A 1136932) are often based 
on those commonly known programs . 
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An embodiment of this invention takes advantage of using 
a program like PRIMER3 first, to then add a number of 
steps that finally result in an advanced method of de- 
signing primers that are specifically useful for amplify- 
ing sequences of low complexity. 

( In the first step of this method for designing specific 
primer molecules for nucleic acids of low complexity, 
primer pairs that amplify single products are selected by 
a PPlying standard tools of primer design known in the 
art, like for example the program PRIMER3 (Rozen, S and 
Skaletsky, H (2000) Methods Mol Biol 132: 365-386). 

In the second step of the method said primer pairs are 
tested whether or not one of its primer molecules when 
hybridizing to any other primer molecule in the set ex- 
ceeds a specified threshold melting temperature TM. If 
this is the case the primer pair that comprises of said 
primer is excluded from the set of potentially combined 
pairs . 

In the third step of the method the number of previously 
selected primer pairs, is reduced to a smaller number by 
implementing as new criteria a measure for the primer se- 
quence's complexity. Primer pairs that consist of a 
primer molecule which does not meet said criteria are. ex- 
cluded. 

The basic problem of finding. a primer specific enough to 
give only one product on the little complex bisulfite 
DNA, is finally solved by testing each potential primer 
pair for hybridization across the whole bisulfite con- 
verted human genome. This requires translating the whole 
human genome sequence information virtually (as in "in 
silico") into its treated, for example bisulfite treated, 
version before performing a similarity search against the 
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primer pairs, which is based on a method like the so 
called e-PCR (Schuler G.D. (1997) Sequence Mapping by 
electronic PCR. Genome Research 7(5): 541-550). However, 
as the bisulfite conversion results in two different ver- 
5 sions of the double helix whose sense and anti-sense 
strands are no longer mutually complementary, this in 
silico amplification needs to be performed on both bisul- 
fite converted versions of the genome. In addition in 
most cases the template DNA is contaminated with uncon- 
10 verted genomic DNA. It cannot be excluded that single cy- 
tosines or longer runs of DNA remain unconverted or are 
only converted incompletely by the bisulfite treatment, 
k To also exclude unwanted amplification of the unconverted 

" DNA as template, the same hybridization test has to be 

15 performed a third time using the whole human genome se- 
quence as a template. 

As this is quite some effort and requires time (CPU time) 
this is the fourth and last step of this design method, 
20 that is absolved prior to the final testing in a 'Vet' 7 , 
lab based, experiment. 



In addition to improve the specificity of said primer mo- 
lecules the stringency of the selection criteria is inc- 

125 reased: Some standard primer design tools disqualify a 
I primer if in the template sequence, a second identical 

sequence, besides the target sequence, can be found. That 
•way mispriming at rather stringent hybridization conditi- 
ons is avoided. This mispriming would not necessarily 
30 lead to an additional unwanted product, but would lead to 
the dilution of the primer molecules available for 
amplification. This selection has been performed in step 
one already (for example by PRIMER3) . However, due to the 
higher probability of a bisulfite primer molecule to mis- 
35 match with non-identical bisulfite treated DNA, there is 
still a chance for said primer molecules to misprime even 
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when up to 20% of the nucleotides of the primer sequence 
differ. Therefore it is claimed in this invention to only 
use primer molecules for which not even a weak sequence 
homology can be found. However,, this would exclude primer 
molecules unnecessarily. Therefore they are only excluded 
if two primer molecules match to the template and amplify 
an unwanted product. This test is performed by means as, 
for example, the Electronic PCR. Electronic PGR (e-PCR) 
is an in silico virtual PCR carried out in order to asses 
the suitability of primers prior to in vitro PCR. 

In the fourth step of the method on how to design these 
primers it is therefore tested whether there are any re- 
gions of the template nucleic acid, said template being 
comprised of the sense and the anti-sense strand of the 
treated and the untreated nucleic acids, that are identi- 
cal in sequence with the primer molecule to more than 80% 
and if those primer molecules are able to amplify an un- 
wanted product. If this is the case, the primer pair 
comprising said primer molecule is excluded from the 
selection. 

The template nucleic acid is comprised of the treated 
template nucleic acid and the untreated template nucleic 
acid. The treated nucleic acid in itself is comprised of 
a two strands which after treatment are not complementary 
to each other anymore. This virtual testing for example 
can be performed as described by Gregory Schuler in his 
article (cited above) about sequence mapping by ''Electro- 
nic PCR". The primer pairs remaining can be used to spe- 
cifically amplify regions of nucleic acids of low comple- 
xity, which is the aim of this invention. Hence step 4 of 
the design method is the virtual testing of each possible 
primer pair combination, under pre-specif ied conditions 
at a stringency allowing for one or more base pair mis- 
matches, as to whether no unwanted nucleic acids are 




amplified. Said virtual testing is carried out upon both 
untreated and treated nucleic acids. The wording "pos- 
sible combinations" refers to all combinations that are 
possible within a set of primer pairs to be used in one 
amplification reaction vessel. 

In a preferred embodiment an additional step is added 
following the virtual testing, which is testing in a lab 
based single PCR assay all those pairs that remained, 
whether the desired amplificate can be obtained or not. 
If that is the case, the chosen pairs can be used to spe- 
cifically amplify those regions of nucleic acids of low 
complexity according to the method as described before. 

In a specially preferred embodiment the first step of the 
design method is characterized as selecting a pool of 
possible primer pairs per amplificate by means of a stan- 
dard PCR primer design program using said nucleic acids 
as template that have been masked for repeats and SNPs 
considering the following factors: 

length of amplificate, length of primer, melting tempera- 
ture of the primer molecule, dimer formation parameters, 
loop formation parameters, exclusion of unidentified or 
ambiguous nucleotides in the primer sequence, exclusion 
of restriction enzyme recognition sites. 

In a preferred embodiment of this invention this measure 
of complexity is a measure of linguistic complexity as 
defined by Bolshoy et al. (see above) . Those primer pairs 
are excluded from the previously selected ones, which 
comprise of one primer that doesn't reach a set level of 
this linguistic complexity. 

In another preferred embodiment of this invention this 
measure of complexity is a measure of Shannon entropy (as 
described before) . 




In an especially preferred embodiment of this design 
method, prior to performing step d) the additional step 
of excluding primer pairs from the remaining primer pairs 
which consist of a primer molecule that comprises of at 
least one CpG site, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the design of said primers, prior to perform- 
ing step d) the additional step of excluding primer pairs 
from the remaining pairs when one of its primer molecules 
does not contain at least one nucleotide within the last 
three nucleotides from the 3' end of the molecule wherein 
said nucleotide is complementary to a nucleotide of the 
target sequence that was converted to a different nucleo- 
tide by bisulfite treatment, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step, of excluding primer 
pairs from the remaining primer pairs which amplify a nu- 
cleic acid that did not prior to treatment with bisulfite 
contain a minimum of two CpG sites, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding primer 
pairs from the remaining primer pairs when one of its 
primer molecules contains more than 5 bases at its 3' end 
that are complementary to any oth^r primer molecules se- 
quence in the set, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 




primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for a number of 
mismatching nucleotides of 20% of the number of nucleo- 
tides of the primer molecule, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 
primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for a number of 
nucleotides creating one gap, when aligning the primer 
molecule sequence with the template sequence, of up to 
20% of the number of nucleotides of the primer molecule, 
is carried out. 



In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 
primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for four or less 
mismatching base pairs, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 
primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
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virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for two or less 
mismatching base pairs, is carried out. 




Claims 

1. A method for the amplification of nucleic acids com- 
5 prising the following steps 

1) isolating a nucleic acid sample, 

2) treating said sample in a manner that differenti- 
ates between methylated and un-methylated cytosine 
bases within said sample, 

3) amplifying at least one target sequence, within 
said treated nucleic acid, by means of enzymatic am- 
plification and a set of primer molecules, wherein 
said primer molecules are characterized in that 

a) each primer molecule sequence reaches a prede- 
fined measure of complexity, 

b) every combination of any two primer molecules 
in the set has a melting temperature below a 
specified threshold temperature, 

c) every combination of two primer molecules, un- 
der conditions allowing for one or more base mis- 
matches per primer, does not lead to the amplifi- 
cation of an unwanted product when virtually 
tested using the treated and the untreated sample 
nucleic acids as template, 

and 

4) detecting said amplified target nucleic acid. 
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2 . A method according to claim 1 wherein said primer 

molecules do not contain nucleic acid sequences com- 
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plementary or identical to nucleic acid sequences of 
the target sequence which prior to treatment of step 
2 contained a 5'-CG-3' site, 

5 3 . A method according to claims 1 and 2 wherein said set 
is comprised of at least one but not more than 32 
primer pairs . 

4 . A method according to claims 1 and 2 wherein said set 
10 is comprised of at least one but not more than 16 

primer pairs . 

5 . A method according to claims 1 to 4 wherein the 
primer molecules are reaching a specified value of 

15 linguistic complexity. 

6 . A method according to claims 1 to 4 wherein the 
primer molecules are reaching a specified value of 
Shannon entropy . 

20 

7. A method according to claims 1 to 6 wherein the nu- 
cleic acid sample is isolated from a. bodily fluid, a 
cell culture, a tissue sample or a combination the- 
reof . 

8 . A method according to claims 1 to 7 wherein the nu- 
cleic acid sample is comprised of plasmid DNA, BACs, 
YACs. or genomic DNA. 

30 9. A method according to claims 1 to 7 wherein the nu- 
cleic acid sample is comprised of human genomic DNA 
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10. A method according to claims 1 to 9 wherein said sam- 
ple is treated by means of a solution of a bisulfite, 
hydrogen sulfite or disulfite. 
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11 . A method according to claims 1 to 10 wherein said 
primer molecule" comprises of at least one nucleotide 
within the last three nucleotides from the 3 ' end of 
the molecule wherein said nucleotide is complementary 

5 to a nucleotide of the target sequence that was con- 

verted to a different nucleotide by the treatment 
performed in step 2) of claim 1. 

12. A method according to claims 1 tolO wherein said 
primer molecule comprises of at least one nucleotide 
within the last three nucleotides from the 3 ' end of 
the molecule wherein said nucleotide is complementary 
to a nucleotide of the target sequence that was con- 
verted to a different nucleotide by bisulfite treat- 
ment . 

13 . A method according to claims 1 to 12 wherein each of 
said primer molecules is characterized in that the 
last at least 5 bases at the 3' end of said primer 

2 0 molecule are not complementary to the sequence of any 

other primer molecule in the set. 

14 . A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3 c) of claim 1 is less than 20% of the number 
of nucleotides of the primer molecule. 

15 . A method according to claims 1 to 13 wherein the num- 
ber of nucleotides creating one gap, when aligning 
the primer molecule sequence with the template se- 
quence, allowed for, when virtually testing the am- 
plification of unwanted products according to step 3 
c) of claim 1 is less than 20% of the number of nu- 
cleotides of the primer molecule. 
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16. A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3c) of claim 1 is less than 10% of the number 

5 of nucleotides of the primer molecule. 

17. A method according to claims 1 to 13 wherein the num- 
ber of nucleotides creating one gap, when aligning 
the primer molecule sequence with the template se- 
quence, allowed for, when virtually testing the am- 
plification of unwanted products according to step 3 
c) of claim 1 is less than 10% of the number of nu- 
cleotides of the primer molecule. 

18. A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3 c) of claim 1 is less than 5% of the number of 
nucleotides of the primer molecule. 

19. A method according to claims 1 to 13 wherein the num- 
ber of nucleotides creating one gap, when aligning 
the primer molecule sequence with the template se- 
quence, allowed for, when virtually testing the am- 
plification of unwanted products according to step 3 
c) of claim 1 is less than 5% of the number of nu- 
cleotides of the primer molecule. 

20. A method according to claims 1 to 13 wherein the num- 
30 ber of mismatches allowed for when virtually testing 

the amplification of unwanted products according to 
step 3 c) of claim 1 is less than seven. 

21. A method according to claim 20 wherein the number of 
35 mismatches allowed for is less than five. 
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22. A method according to claim 20 wherein the number of 
mismatches allowed for is less than three. 

23. A method according to claim 20 wherein the number of 
mismatches allowed for is one. 

24 . A method according to claims 1. to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3 c) of claim 1 is determined by a pre-specif ied 
maximum melting temperature. 

25. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 
quences that prior to treatment of step 2 comprised 
of more than eight 5'-CG-3' sites. 

26. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 
quences that prior to treatment of step 2 comprised 
of more than six 5'-CG-3' sites. 

27. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 
quences that prior to treatment of step 2 comprised 
of more than four 5'-CG-3' sites. 

28. A. method according to- claims 1 to 24- wherein said 
primer molecules are used to amplify nucleic acid se- 
quences that prior to treatment of step 2 comprised 
of more than two 5'-CG-3' sites. 

29. A method according to claims 1 to 28 wherein the 
ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR. 
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A method according to claims 1 to 28 wherein the 
ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR, 
taking as template nucleic acid the coding strand of 
the treated sample, the non-coding strand of the 
treated sample and both of the strands of the un- 
treated sample. 

A method according to claims 1 to 28 wherein the 
ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR, 
taking as template nucleic acid the coding strand of 
the bisulfite converted human genome, the non- coding 
strand of the bisulfite converted human genome and 
both of the strands of the untreated human genome. 

A method according to claims 1 to 31 wherein said 
primer molecules are used to amplify nucleic acids 
which are comprised of at least 50 bp but not more 
than 2000 bp. 

A method according to claims 1 to 31 wherein said 
primer molecules are used to amplify nucleic acids 
which are comprised of at least 80 bp but not more 
than 1000 bp. 

A method according to claims 1 to 33 wherein said 
primer molecules are comprised of 16 - 50 nucleo- 
tides . 

A method according to claims 1 to 34 wherein said 
primer molecules do not form dimers with each other. 
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36. A method according to claims 1 to 35 wherein said 
primer molecules do not form loops or hairpin struc- 
tures. 

37. A method according to claims 1 to 36 wherein said 
primer molecules are complementary to target se- 
quences whict£ prior to the treatment performed in 
step 2) of claim 1 did not contain specified restric- 
tion enzyme recognition sites. 

38. A method according to claims 1 to 3 7 wherein said 
primer molecules amplify regions of the treated nu- 
cleic acids which prior to the treatment performed in 
step 2) of claim 1 did not contain specified restric- 
tion enzyme recognition sites. 

39. A method for designing primers according to claim 1, 
comprising the steps of 

a) selecting a pool of possible primer pairs per am- 
plificate by means of a standard PCR primer design 
program using said nucleic acids as template 

b) excluding those primer pairs which comprise of a 
primer that in combination with another primer mole- 
cule in the same set exceeds a threshold melting tem- 
perature 

c) excluding those primer pairs which comprise of a 
primer that does not reach a specified level of com- 
plexity 

d) excluding those primer pairs which comprise of a 
primer that in combination with another primer mole- 
cule in the same set, under conditions allowing for 
one or more base mismatches per primer, amplifies an 
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unwanted product when virtually tested using the 
treated and the untreated sample nucleic acid as tem- 
plate. 

40. a method for designing said primer molecules accord- 
ing to claim 1 and 43, adding the step of 

e) excluding from the remaining confirmed primer 
pairs those pairs which in said amplification step do 
not result in the amplification of the intended prod- 
uct when performing a single PCR experiment. 

41. A method for designing primers according to claims 3 9 
and 40 , wherein said template nucleic acids are 
masked for repeats and SNPs before designing said 
primer molecules and wherein said standard PCR primer 
design program considers one or more of the following 
factors 

length of amplificate, length of primer, melting tem- 
perature of the primers, dimer formation parameters, 
loop formation parameters, exclusion of unidentified 
or ambiguous nucleotides in the primer sequence, ex- 
clusion of restriction enzyme recognition sites. 

42. A method according to claims 3 9 to 41 wherein said 
measure of complexity is a measure of linguistic com- 
plexity. . 

43. A method according to claims 3 9 to 41 wherein said 
measure of complexity is a measure of Shannon en- 
tropy. 

44. A method according to claims 39 to 43 wherein the 
following step is carried out prior to performing 
step d) 





excluding from the remaining primer pairs those 
pairs, which consist of a primer molecule that com- 
prises of at least one CpG site. 



45. A method according to claims 39 to 44 wherein the 
following step is carried out prior to performing 
step d) 

excluding from the remaining primer pairs those 
pairs, which consist of a primer molecule that does 
not contain at least one nucleotide within the last 
three nucleotides from the 3' end of the molecule 
wherein said nucleotide is complementary to a nucleo- 
tide of the target sequence that was converted to a 
different nucleotide by the treatment performed in 
step 2) of claim 1. 

46. A method according to claims 39 to 45 wherein the 
following step is carried out prior to performing 
step d) 

excluding from the remaining primer pairs those 
pairs, which consist of a primer molecule that con- 
tains more than 5 bases at its 3' end that are com- 
plementary to any other primer molecules' sequence in 
the. . set . . 

47. A method according to claim 39 to 46 wherein the fol- 
lowing step is carried out prior to performing step 
d) 

excluding from the remaining primer pairs those 
pairs, which amplify a nucleic acid that did not, 



prior to the treatment in step 2 of claim 1, contain 
at least two CpG sites 

A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3c) of claim 1 under conditions 
allowing for a number of mismatching nucleotides of 
20% of the number of nucleotides of the primer mole- 
cule. 

A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3 c) of claim 1 under conditions 
allowing for a number of nucleotides creating one 
gap, when aligning the primer molecule sequence with 
the template sequence, of up to 20% of the number of 
nucleotides of the primer molecule. 

A method according to claim 3 9 to 47 wherein the 
following step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
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according to step 3 c) of claim 1 under conditions 
allowing for four or less mismatching base pairs. 

. A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3 c) of claim 1 under conditions 
allowing for two or less mismatching base pairs. 
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Abstract 



The present invention relates to a method for the ampli- 
fication of nucleic acids comprising the following steps 

1) isolating a nucleic acid sample, 

2) treating said sample in a manner that differentiates 
between methylated and un-methylated cytosine bases 
within said sample, 

3) amplifying at least one target sequence, within said 
treated nucleic acid, by means of enzymatic amplification 
and a set of primer molecules, wherein said primer mole- 
cules are characterized in that, 

a) each primer molecule sequence reaches a predefined 
measure of complexity 

b) every combination of any two primer molecules in 
the set has a melting temperature below a specified 
threshold temperature 

c) every combination of two primer molecules, under 
conditions allowing for one or more base mismatches 
per primer, does not lead to the amplification of an 
unwanted product when virtually tested using the 
treated and the untreated sample nucleic acids as 
template 

and 

4) detecting said amplified target nucleic acid. 

The invention describes a method for amplifying- nucleic 
acids, such as DNA with means of an enzymatic amplifica- 
tion step, such as a polymerase chain reaction, specified 
for template nucleic acids of low complexity, e.g. pre- 
treated DNA, like but not limited to DNA pre-treated with 
bisulfite is disclosed. The invention is based on the use 
of specific oligo-nucleotide primer molecules to solely 
amplify specific pieces of DNA. It is disclosed how to 



optimize the primer design for a PCR if the template DNA 
is of low complexity. 
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